VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio Track

نویسندگان

Pablo Garrido

Levi Valgaerts

H. Sarmadi

I. Steiner

Kiran Varanasi

Patrick Pérez

Christian Theobalt

چکیده

In many countries, foreign movies and TV productions are dubbed, i.e., the original voice of an actor is replaced with a translation that is spoken by a dubbing actor in the country’s own language. Dubbing is a complex process that requires specific translations and accurately timed recitations such that the new audio at least coarsely adheres to the mouth motion in the video. However, since the sequence of phonemes and visemes in the original and the dubbing language are different, the video-to-audio match is never perfect, which is a major source of visual discomfort. In this paper, we propose a system to alter the mouth motion of an actor in a video, so that it matches the new audio track. Our paper builds on high-quality monocular capture of 3D facial performance, lighting and albedo of the dubbing and target actors, and uses audio analysis in combination with a space-time retrieval method to synthesize a new photo-realistically rendered and highly detailed 3D shape model of the mouth region to replace the target performance. We demonstrate plausible visual quality of our results compared to footage that has been professionally dubbed in the traditional way, both qualitatively and through a user study.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Vodcast: A Breakthrough in Developing Incidental Vocabulary Learning

Incidental vocabulary learning is often seen as superior to direct instruction on many occasions. Meanwhile, upon the emergence of the World Wide Web, second language (SL) learners have been introduced to 'podcasts' (recorded audio and video online broadcasts) which could be authentic sources of vocabulary learning. The relatively recent phenomenon of video podcast (vodcast) might be considered...

متن کامل

Summarizing Audiovisual Contents of a Video Program

In this paper, we focus on video programs that are intended to disseminate information and knowledge such as news, documentaries, seminars, etc, and present an audiovisual summarization system that summarizes the audio and visual contents of the given video separately, and then integrating the two summaries with a partial alignment. The audio summary is created by selecting spoken sentences tha...

متن کامل

مقایسه اثر بخشی ریلکسیشن پیشرونده، ترکیب ریلکسیشن با تحریکات ریتمیک نوری و ترکیب ریلکسیشن با تحریکات ریتمیک صوتی بر ضربان قلب و فشار خون دانشجویان

Background and purpose: The aim of this research was to compare the efficacy of relaxation, and relaxation combined by periodic visual stimulation and periodic audio stimulation on blood pressure and heart rate of university students. Materials and methods: This experimental study was conducted in 36 psychology students in Allameh Tabatabaee University. The students were randomly selected and...

متن کامل

Visual contribution to the multistable perception of speech.

The multistable perception of speech, or verbal transformation effect, refers to perceptual changes experienced while listening to a speech form that is repeated rapidly and continuously. In order to test whether visual information from the speaker's articulatory gestures may modify the emergence and stability of verbal auditory percepts, subjects were instructed to report any perceptual change...

متن کامل

Video Summarization Based on Balanced AV-MMR

Among the techniques of video processing, video summarization is a promising approach to process the multimedia content. In this paper we present a novel summarization algorithm, Balanced Audio Video Maximal Marginal Relevance (Balanced AV-MMR or BAV-MMR), for multi-video summarization based on both audio and visual information. Balanced AVMMR exploits the balance between audio information and ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Comput. Graph. Forum

دوره 34 شماره

صفحات -

تاریخ انتشار 2015

VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio Track

نویسندگان

چکیده

منابع مشابه

Vodcast: A Breakthrough in Developing Incidental Vocabulary Learning

Summarizing Audiovisual Contents of a Video Program

مقایسه اثر بخشی ریلکسیشن پیشرونده، ترکیب ریلکسیشن با تحریکات ریتمیک نوری و ترکیب ریلکسیشن با تحریکات ریتمیک صوتی بر ضربان قلب و فشار خون دانشجویان

Visual contribution to the multistable perception of speech.

Video Summarization Based on Balanced AV-MMR

عنوان ژورنال:

اشتراک گذاری